Search CORE

21 research outputs found

Predicting the Next Best View for 3D Mesh Refinement

Author: A Hornung
C Potthast
CS Fraser
E Nocerino
HH Vu
J Delmerico
JE Banta
JI Vasquez-Gomez
K Schmid
P Meixner
S Chen
S Kriegel
Shiwei Li
WR Scott
Y Egels
Publication venue
Publication date: 01/01/2018
Field of study

3D reconstruction is a core task in many applications such as robot navigation or sites inspections. Finding the best poses to capture part of the scene is one of the most challenging topic that goes under the name of Next Best View. Recently, many volumetric methods have been proposed; they choose the Next Best View by reasoning over a 3D voxelized space and by finding which pose minimizes the uncertainty decoded into the voxels. Such methods are effective, but they do not scale well since the underlaying representation requires a huge amount of memory. In this paper we propose a novel mesh-based approach which focuses on the worst reconstructed region of the environment mesh. We define a photo-consistent index to evaluate the 3D mesh accuracy, and an energy function over the worst regions of the mesh which takes into account the mutual parallax with respect to the previous cameras, the angle of incidence of the viewing ray to the surface and the visibility of the region. We test our approach over a well known dataset and achieve state-of-the-art results.Comment: 13 pages, 5 figures, to be published in IAS-1

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds

Author: A Geiger
A Owens
A Owens
BC Russell
C Rascon
Computational auditory scene analysis
D Li
F Antonacci
H Wallach
H Zhao
I Dokmanic
J Delmerico
J Tiete
KI McAnally
L-C Chen
LC Chen
LD Rosenblum
R Fendrich
R Gao
S Argentieri
S Hecker
U Klee
W Huang
WR Thurlow
WW Gaver
Y Tian
Publication venue
Publication date: 09/03/2020
Field of study

Humans can robustly recognize and localize objects by integrating visual and auditory cues. While machines are able to do the same now with images, less work has been done with sounds. This work develops an approach for dense semantic labelling of sound-making objects, purely based on binaural sounds. We propose a novel sensor setup and record a new audio-visual dataset of street scenes with eight professional binaural microphones and a 360 degree camera. The co-existence of visual and audio cues is leveraged for supervision transfer. In particular, we employ a cross-modal distillation framework that consists of a vision `teacher' method and a sound `student' method -- the student method is trained to generate the same results as the teacher method. This way, the auditory system can be trained without using human annotations. We also propose two auxiliary tasks namely, a) a novel task on Spatial Sound Super-resolution to increase the spatial resolution of sounds, and b) dense depth prediction of the scene. We then formulate the three tasks into one end-to-end trainable multi-tasking network aiming to boost the overall performance. Experimental results on the dataset show that 1) our method achieves promising results for semantic prediction and the two auxiliary tasks; and 2) the three tasks are mutually beneficial -- training them together achieves the best performance and 3) the number and orientations of microphones are both important. The data and code will be released to facilitate the research in this new direction.Comment: Project page: https://www.trace.ethz.ch/publications/2020/sound_perception/index.htm

arXiv.org e-Print Archive

Repository for Publications and Research Data

Crossref

The Lung Screen Uptake Trial (LSUT): protocol for a randomised controlled demonstration lung cancer screening pilot testing a targeted invitation strategy for high risk and ‘hard-to-reach’ patients

Author: A Cassidy
A Lopes Pegna
Andy McEwen
ARH Dalton
C Hayton
C Wagner von
CA Iersel van
D Kahneman
D Patel
David R. Baldwin
DR Aberle
DR Aberle
DR Baldwin
EL O’Dowd
FE McRonald
G Libby
GA Silvestri
Gary Nolan
J Delmerico
J Hersch
J Waller
J Waller
J Wardle
Jane Wardle
John Isitt
JSBT Evans
Karen Sennett
KF Schulz
L Bryan
L Camilloni
L Elliss-Brookes
M Holmes-Rovner
M Infante
Mamta Ruparel
MC Politi
MM Byrne
MS Hestbech
MW Vander Weg
ND Weinstein
P Hewitson
PKJ Han
PM Marcus
Rebecca J. Beeken
S Jonnalagadda
SA Kovalchik
SA Sabatino
Samantha L. Quaife
Samuel M. Janes
SK Linder
SL Quaife
SR Cole
Stephen W. Duffy
TC Davis
TF Heatherton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background Participation in low-dose CT (LDCT) lung cancer screening offered in the trial context has been poor, especially among smokers from socioeconomically deprived backgrounds; a group for whom the risk-benefit ratio is improved due to their high risk of lung cancer. Attracting high risk participants is essential to the success and equity of any future screening programme. This study will investigate whether the observed low and biased uptake of screening can be improved using a targeted invitation strategy. Methods/design A randomised controlled trial design will be used to test whether targeted invitation materials are effective at improving engagement with an offer of lung cancer screening for high risk candidates. Two thousand patients aged 60–75 and recorded as a smoker within the last five years by their GP, will be identified from primary care records and individually randomised to receive either intervention invitation materials (which take a targeted, stepped and low burden approach to information provision prior to the appointment) or control invitation materials. The primary outcome is uptake of a nurse-led ‘lung health check’ hospital appointment, during which patients will be offered a spirometry test, an exhaled carbon monoxide (CO) reading, and an LDCT if eligible. Initial data on demographics (i.e. age, sex, ethnicity, deprivation score) and smoking status will be collected in primary care and analysed to explore differences between attenders and non-attenders with respect to invitation group. Those who attend the lung health check will have further data on smoking collected during their appointment (including pack-year history, nicotine dependence and confidence to quit). Secondary outcomes will include willingness to be screened, uptake of LDCT and measures of informed decision-making to ensure the latter is not compromised by either invitation strategy. Discussion If effective at improving informed uptake of screening and reducing bias in participation, this invitation strategy could be adopted by local screening pilots or a national programme. Trial registration This study was registered with the ISRCTN (International Standard Registered Clinical/soCial sTudy Number : ISRCTN21774741) on the 23rd September 2015 and the NIH ClinicalTrials.gov database (NCT0255810) on the 22nd September 2015

Crossref

Springer - Publisher Connector

UCL Discovery

PubMed Central

Queen Mary Research Online

White Rose Research Online

FigShare

A Benchmark Comparison of Monocular Visual-Inertial Odometry Algorithms for Flying Robots

Author: Delmerico J.
Scaramuzza D.
Publication venue
Publication date: 03/04/2018
Field of study

Flying robots require a combination of accuracy and low latency in their state estimation in order to achieve stable and robust flight. However, due to the power and payload constraints of aerial platforms, state estimation algorithms must provide these qualities under the computational constraints of embedded hardware. Cameras and inertial measurement units (IMUs) satisfy these power and payload constraints, so visual- inertial odometry (VIO) algorithms are popular choices for state estimation in these scenarios, in addition to their ability to operate without external localization from motion capture or global positioning systems. It is not clear from existing results in the literature, however, which VIO algorithms perform well under the accuracy, latency, and computational constraints of a flying robot with onboard state estimation. This paper evaluates an array of publicly-available VIO pipelines (MSCKF, OKVIS, ROVIO, VINS-Mono, SVO+MSF, and SVO+GTSAM) on different hardware configurations, including several single- board computer systems that are typically found on flying robots. The evaluation considers the pose estimation accuracy, per-frame processing time, and CPU and memory load while processing the EuRoC datasets, which contain six degree of freedom (6DoF) trajectories typical of flying robots. We present our complete results as a benchmark for the research community

Infoscience - École polytechnique fédérale de Lausanne

ZORA

AirTouch: Interacting With Computer Systems At A Distance

Author: Albert Y. C. Chen
Caiming Xiong
Daniel R. Schlegel
Jason J. Corso
Jeffrey A. Delmerico
Publication venue
Publication date: 08/05/2011
Field of study

We present AirTouch, a new vision-based interaction system. AirTouch uses computer vision techniques to extend commonly used interaction metaphors, such as multitouch screens, yet removes any need to physically touch the display. The user interacts with a virtual plane that rests in between the user and the display. On this plane, hands and fingers are tracked and gestures are recognized in a manner similar to a multitouch surface. Many of the other vision and gesture-based human-computer interaction systems presented in the literature have been limited by requirements that users do not leave the frame or do not perform gestures accidentally, as well as by cost or specialized equipment. AirTouch does not suffer from these drawbacks. Instead, it is robust, easy to use, builds on a familiar interaction paradigm, and can be implemented using a single camera with off-the-shelf equipment such as a webcam-enabled laptop. In order to maintain usability and accessibility while minimizing cost, we present a set of basic AirTouch guidelines. We have developed two interfaces using these guidelines—one for general computer interaction, and one for searching an image database. We present the workings of these systems along with observational results regarding their usability. 1

CiteSeerX

Crossref

The Blackbird UAV dataset

Author: Amado Antonini
Antonini A
Delmerico J
Gaidon A
Lottes T
Qin T
Sertac Karaman
Shah S
Thomas Sayre-McCord
Varun Murali
Winter Guerra
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Unsupervised learning of depth and ego-motion with absolutely global scale recovery from visual and inertial data sequences

Author: Abadi M
Delmerico J
DeTone D
Holmgren DE
Kingma DP
Liu F
Mikolov T
Qin T
Radford A
Simonyan K
Sun Q
Vijayanarasimhan S
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Recommended from our members

Fast, Autonomous Flight in GPS-Denied and Cluttered Environments

Author: Atanasov N.
Daniilidis K.
Delmerico J.
Karydis K.
Kumar V.
Liu S.
Loianno G.
Makineni A.
Mohta K.
Mulgaonkar Y.
Qu C.
Saulnier K.
Scaramuzza D.
Sun K.
Taylor C. J.
Watterson M.
Zhu A.
Publication venue: eScholarship, University of California
Publication date: 06/12/2017
Field of study

eScholarship - University of California